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ABSTRACT 

Recently, it has been found that the field traced by QSO's Lya forests is intermittent on 
small scales. Intermittent behavior is essential for understanding the statistics and dynamics of 
cosmic gravitational clustering in the nonlinear regime. The most efi'ective method of describing 
intermittency uses the structure functions and the intermittent exponent, which measure the 
scale- and order-dependencies of the ratio between the higher order to second order moments of 
the field. These properties can be used not only to confirm the non-gaussianity of fields, but also 
to detect the type of non-gaussianity. 

In this paper, we calculate the structure function and intermittent exponent of 1.) Keck 
data, which consists of 28 high resolution, high signal to noise ratio (S/N) QSO Lya absorption 
spectra, and 2.) Lya forest simulation samples produced via the pseudo hydro scheme for the 
low density cold dark matter (LCDM) model and warm dark matter (WDM) model with particle 
mass mw = 300, 600, 800 and 1000 eV. Aside from the WDM model with mw = 300 eV, the 
simulation samples are in agreement with observations in the context of the power spectrum. 
We find, however, that the intermittent behavior of all the simulation samples is substantially 
inconsistent, both quantitatively and qualitatively, with the Keck data. Specifically, 1.) the 
structure functions of the simulation samples are significantly larger than that of Keck data on 
scales k > 0.1 km^^ s, 2.) the intermittent exponent of the simulation samples is more negative 
than that of Keck data on all redshifts considered, 3.) the order-dependence of the structure 
functions of simulation samples are closer to the intermittency of hierarchical clustering on all 
scales, while the Keck data are closer to a lognormal field on small scales. These differences 
are independent of noise and show that the intermittent evolution modeled by the pseudo-hydro 
simulation is substantially different from observations, even though they are in good agreement 
with each other in terms of second and lower order statistics. This result also shows that "weakly" 
clustered samples, like high resolution Lya absorption spectrum, are effective in testing dynamical 
models of structure formation if their intermittent features are considered. 

Subject headings: cosmology: theory - large-scale structure of the universe 
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1. Introduction 

The non-gaussian features of the QSO's Lya 
transmitted flux were recently studied and showed 
that the cosmic mass field traced by the QSO's 
Lya forests is intermittent on small scales. That 
is, the probability distribution functions (PDF) of 
the transmitted fiux fluctuations were found to 
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be substantially long tailed on scales less than 
~ 1 Mpc. The spatial distribution of the 
power of the transmission fluctuations was found 
to be spiky, i.e., the power was concentrated in 
rare modes while most modes showed very low 
power (Jamkhedkar, Zhan and Fang 2000; Zhan, 
Jamkhedkar and Fang 2001; Feng, Pando and 
Fang 2001). 

The impact of intermittency on observational 
cosmology is that the power spectrum is no longer 
effective for testing models when intermittency is 
substantial. Although spikes in the intermittent 
field are rare and improbable events, one cannot 
neglect the spikes in measuring the power spec- 
trum as almost all power of the transmission fluc- 
tuations is concentrated in these events. In such a 
case, the power spectrum is dominated by rare and 
improbable events (spikes). An intermittent field 
is statistically homogeneous. Yet, the rare events 
lead to significant differences among samples from 
different parts of the universe when the spatial size 
of the region is not large enough to contain nu- 
merous spikes. The precision of determining the 
power spectra is intrinsically constrained by in- 
termittency. The uncertainty of the power spec- 
trum for an intermittent field is inevitably large 
(Jamkhedkar et al 2002, hereafter Paper I). 

Therefore, in the nonlinear regime, more ef- 
fective measures to compare the predictions of 
models and observations must be used. Although 
an intermittent field is non-gaussian, many non- 
gaussian statistics are ineffective or insensitive to 
intermittency. This is because we need not only 
techniques for confirming the non-gaussianity of a 
field, but also methods which can tell the type of 
non-gaussianity. The most effective measures sen- 
sitive to the type of intermittency are the struc- 
ture functions and the intermittent exponent (e.g. 
Shraiman & Siggia 2000). 

In this paper, using a set of 28 high resolu- 
tion, high signal to noise ratio (S/N) QSO Lya 
absorption spectra (Keck data), we will carry out a 
systematic investigation of the structure functions 
and the intermittent exponent of the transmitted 
fiux. To show the usefulness of the structure func- 
tion and intermittent exponent as model discrim- 
inators, we also study the intermittent behaviors 
of Lya forest samples produced by a pseudo-hydro 
simulation technique for the LCDM and WDM 
models. These models are among the best that 



match the observed power spectrum and other 
lower than second order statistics. We will show 
that the simulation samples are significantly inter- 
mittent. However, both the quantitative and qual- 
itative properties of simulation samples are found 
to be substantially different from the Keck data. 

The paper is organized as follows. §2 addresses 
the method for measuring intermittent random 
fields. §3 presents the algorithm for calculating the 
struc;ture functions and intermittent exponent. §4 
describes, both the observed and simulated sam- 
ples, that we will use. The power spectrum and 
other lower order statistics of the Keck data and 
simulation samples are given in §5. §6 contains 
the important result, the structure function, in- 
termittent exponent and their of order-, redshift- 
and scale-dependencies for the Keck data and sim- 
ulation samples. The emphasis is on revealing the 
deviation of the intermittent behavior for the sim- 
ulation samples from the Keck data. Finally, the 
conclusions and discussions are in §7. 

2. Statistical description of an intermit- 
tent random field 

2.1. Basic properties of an intermittent 
field 

Physically, intermittency is used to character- 
ize a special type of random field in which struc- 
tures are essentially strong enhancements, peaks 
or spikes, randomly and widely scattered in space 
and/or time, with a low field value between the 
spikes. The spike-gap-spike feature is more pro- 
nounced on smaller scales. That is, compared with 
gaussian fields, the probability distribution func- 
tion (PDF) of the fiuctuations possesses a high 
peak around zero and possibly a long-tail. The 
long-tail events correspond to the rare and im- 
probable high peaks. This feature was originally 
found in the temperature and velocity distribu- 
tions in turbulence (Batchelor & Townsend 1949). 

Mathematically, an intermittent random den- 
sity field p{x) is defined by the ratio between the 
high- and low-order moments of the field 

{[p{x + r)-p{x)]^^) ^ / r\C(n) 
[{[p{x + r)-p{xm]-^\L/ ' 

where L is the size of the sample, and the exponent 
C can be function of n and r. If ( is negative for 
small r, the ratio eq.(l) is divergent with r — > 
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0. This corresponds to an intermittent field. An 
intermittent field is characterized by the n- and 
r-dependencies of exponent C- 

These properties make characterizing intermit- 
tent fields by traditional statistical measures diffi- 
cult. The power spectrum and two-point correla- 
tion function are unable to quantify intermittency. 
Two fields having the same power spectrum may 
have very different tails in their PDF. Very dif- 
ferent from the linear regime, the power spectrum 
will on longer be a critical discriminator among 
models of structure formation for an intermittent 
field. 

Equally ineffective arc any individual higher 
order correlation functions and higher order mo- 
ments of the density fluctuations S{x) = [p{x) — 
p\/p. Three- and four-point correlation functions 
of the density perturbation 5{x) are very useful 
for distinguishing a gaussian from a non-gaussian 
field. However, they are insensitive to the dif- 
ference between an intermittent field and a non- 
gaussian, but non-intermittent field. Moreover, 
eq.(l) means that the PDF of the density fluc- 
tuations [p{x + r) — p{x)] cannot be expanded 
into a series of moments of [p{x + r) — p{x)], 
as it does not converge. In this case, a bet- 
ter measure is not the divergent quantity itself, 
{[p{x -h r) - p{x)f'^) / {[p{x + r)- p{x)f)'^ when 
r ^ 0, but the quantity describing the divergent 
behavior, specifically, the exponent C,. 

2.2. Definition of the intermittent expo- 
nent 

Let us define density difference Ar(x) = p{x -\- 
r) — p{x). The density difference is equal to the 
density contrast difference, 6{x + r) — S{x), if the 
mean density p is normalized to 1. Intermittency 
is now defined by the divergence of the ratio be- 
tween the high and low order moments |Ar(a;)|". 
For a gaussian field, the statistical properties of a 
field p{x) and its difference p{x + r) — p{x) are the 
same, while they are different for an intermittent 
field. 

The ensemble average of the moment |Ar(a;)|^" 

is 

5^ = (|A.(x)p") (2) 

where n is a positive integer. If the field is homo- 
geneous, S^" is independent of x and depends only 
on r. 5^" is called the structure function. When 



the "fair sample hypothesis" is applicable (Peebles 
1980), the structure function can be calculated as 
the spatial average 

= 1 y \Ar{x)\'"dx, (3) 

where L is the spatial range of the sample. When 
n = 1, we have 

= (|A,(ar)|2). (4) 

is the mean of the square of the density fluctua- 
tions at wavcnumbcr k ~ 27r/r, and therefore, the 
r-dependence of actually is a different version 
of the ordinary power spectrum of the field. 

The intermittent exponent ( is defined by^ 

The intermittency of the field is effectively mea- 
sured by the structure function and the intermit- 
tent exponent. 

For an intermittent field, the ratio of 5^" to 

[5*2]" is larger for smaller r, and therefore, the ex- 
ponent ( is negative. The condition 5^" 3> [S^]" 
on small scales r <^ L indicates that the field con- 
tains "abnormal" events of large density fluctua- 
tions |Ar(a;)| on scale r. Thus, the more negative 
the exponent ( on smaller scales, the stronger the 
"abnormal" events on smaller scales. This gives 
rise to the spiky structure of an intermittent fleld. 
Generally speaking, the intermittent exponent ( 
measures the smoothness of the field: for positive 
(, the field is smoother on smaller scales. If ( is 
negative, the field is rough on small scales, and 
can even be singular. 

2.3. Examples of the intermittent expo- 
nent 

1. A Gaussian field. 

If the homogeneous and isotropic field is gaus- 
sian, the structure function eq.(2) is 

/oo 
Pg[Ar{x)]\Ar{x)\^^dAr{x) (6) 
-OO 

= (2n-l)!![5,T, 

^In turbulence, S^"/[5^]" is used to define the so-called 
anomalous scaling (Shraiman & Siggia 2000.) 
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where Pg is the gaussian PDF of Ar{x). Thus, if 
the mass density field is gaussian, we have 



2n 



(2n- 1)!!. 



(7) 



This ratio is independent of scale r, and therefore, 
the intermittent exponent C = 0- 
2. A self-similar field 

For a self-similar field, the average of Ar{x) on 
different scales satisfies 



A,(.x)|^") = A^"''(|Aa.(x)|^") 



(8) 



where A gives the scale factor from r to Ar, and h 
is the self-similar index. In this case, we have 



Qzn 

Or — 



2nh 



and therefore 



Q2n Q2n 



'21 n 



(9) 



(10) 



Therefore, is independent of r. We have 

then C = and self-similar fields are not intermit- 
tent. 

3. Hierarchical clustering. 

Hierarchical clustering is popular in modeling 
the nonlinear clustering of galaxies. This scheme 
assumes that the correlation functions of the mass 
density can be described by the linked-pair ap- 
proximation, i.e., the n-th irreducible correlation 
function ^„ is given by the two-point correlation 
function ^2 as ^„ = Qn^""^, where the hierarchi- 
cal coefficient Qn is constant (White 1979). 

It has been shown that a hierarchical clustered 
field is (Feng, Pando & Fang 2001) 



^2n 



2]n 



DC 



(l) 



-(d-K)(n-l) 



(11) 



where d is the spatial dimension, and the coeffi- 
cient K is a constant depending on the power law 
index of the power spectrum. In the case where 
K < d, the field is intermittent with exponent 



C~-(d-K)(n-l). 



(12) 



This is the simplest type of intermittency - 
a mono-fractal with fractal dimension k in d- 
dimensional space. A phenomenological model 



with hierarchical relations was developed by 
Soneira & Peebles (1977). The model is essen- 
tially the same as the so-called /3-model of the 
intermittency in turbulence where a dimension k 
fractal distribution in d-dimensional space has the 
exponent C is given by eq.(12) (Frisch 1995.) 
4. A Lognormal field 

Since the time of Hubble, the lognormal distri- 
bution has been used to model the PDFs of the 
cosmic mass and velocity field, such as the one- 
point distribution of the number of galaxies, veloc- 
ity difference, angular momentum, etc (e.g. Yang 
et al. 2001b). The lognormal model of the bary- 
onic matter distribution (Bi & Davidson, 1997; 
Feng & Fang 2000) has also been found to be in 
good agreement with all observed properties of the 
Lya forests. 

For a lognormal field, the PDF of Ar{x) is given 

by 
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.(a;)|c7(r) 



exp 



1 / In \Ar{x)\ - In \Arix) 



a{r) 



(13) 

where the variance a{r) of In | Ar(x) | can be a func- 
tion of the scale r. With eq.(13), we have (Van- 
marcke 1983) 



<?2n 

•^r _ g2(n^-n)o-^(r) 



(14) 



Using eq.(5), the intermittent exponent of a log- 
normal field is 



C ^2{n^ -n)a^{r)/ln{r/L). 



(15) 



When r < L,(is negative. Therefore, a lognormal 
field is intermittent. 

To summarize, the structure function and in- 
termittent exponent provide a complete and uni- 
fied description of intermittent fields. The n- and 
r-dependencies of the structure functions and in- 
termittent exponent C are sensitive to the details 
of the intermittency of the field. These mea- 
sures are very powerful for distinguishing among 
fields that are gaussian, self-similar, mono-fractal, 
multi-fractal, and singular. 
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3. The intermittent exponent 

3.1. Statistical variables in the DWT rep- 
resentation 

The quantity Ar{x) or [S{x + r) ~6(x)] contains 
two variables: the position x and the scale r, and 
therefore, d{x) — d{x + r) should be calculated with 
a space-scale decomposition. Moreover, as defined 
by eq.(2), S^" is sometimes a diverging quantity. 
For instance, (S^) = 2[{d{x)5{x)) - {S{x)S{x + r))] 
is divergent if the 2-point correlation function 
{S{x)6(x + r)) ~ r^". This problem can be han- 
dled in the same way as N-point correlation func- 
tions are handled, i.e., smoothing the density field 
at small scale prior to its measurement. How- 
ever, the size of the smoothing scale is put in by 
hand and might induce uncertainties, especially in 
studying the scaling behavior on scale comparable 
with the smoothing size. 

Ideally one would like a space-scale decomposi- 
tion without the need for pre-smoothing by hand. 
This leads us to chose the discrete wavelet trans- 
form (DWT). The DWT performs a smoothing 
"automatically" as the density difference [S{x) — 
d{x + r)] on scale r is calculated by a difference 
between the smoothed densities at spatial ranges 
{x, X + r/2) and {x + r/2, x + r). In wavelet anal- 
ysis, the smoothing is done scale-by-scale. More- 
over, the bases that the DWT uses for smooth- 
ing and decomposing distributions are orthogonal 
and complete. The smoothing and decomposition 
are optimized and there is no loss of information. 
Furthermore, no mixing of the density fluctuations 
among different scales occurs. This property is ex- 
cellent for studying scaling. 

Here we only very briefly introduce the DWT 
decomposition, as it has been introduced in our 
previous publications (e.g. Pando & Fang 1996; 
Pando & Fang 1998; Fang & Feng 2000). For de- 
tafls on the DWT refer to Mallat (1989a,b); Meyer 
(1992); Daubechies, (1992), and for physical appli- 
cations, refer to Fang & Thews (1998). 

We restrict our discussion to a 1-D random field 
of the transmission fiux F{x) extending in a spa- 
tial or redshift range L = X2 — xi. To apply the 
DWT, we first chop the spatial range L = X2 — xi 
of the 1-D sample into 2^ subintervals labeled with 
I = 0, ...2^ — 1 where j is a positive integer. Each 
subinterval spans a spatial range L/2^. The subin- 



terval / is from xi+Ll/2^ to a;i+i(/ + l)/2^. That 
is, we decompose the space L into cells {j, I) , where 
j denotes the scale L/2^ , and I the spatial range 
{xi+Ll/2^,xi + L{l + l)/2^). CcU (jj) is localized 
in scale space and physical (or redshift) space. 

Corresponding to each cell, there is a scaling 
function (j)j^i{x)^ and a wavelet function ipjj{x). 
These functions are the basis for the scale-space 
decomposition. The most important property of 
the DWT basis is its locality in both scale and 
physical spaces. The scaling function <j)j,i{x) is a 
window function on scale j and at position I. The 
wavelet function ipji{x) is admissible (Daubechies 
1992), i.e., / ipji{x)dx = 0, and therefore, it mea- 
sures the fluctuation on scale j and at position I. 

With the DWT, a transmission F{x) can be 
decomposed as (Fang & Thews 1998). 

2^-1 ,/ 2''-l 

1=0 j'=j 1=0 

(16) 

where J is given by the finest scale (resolution) of 
the sample, i.e., Az = L/2'^, and j is the scale of 

interest. The scaling function coefficient (SFC) ej^ 
in eq.(16) is given by projecting F{x) onto 4>j,i{x) 

el, = I F{x)cl>j,i{x)dx. (17) 

The SFC describes the mean (or smoothed) 
field of the mode {j, I). 

The wavelet function coefficient (WFC), e^i, in 
eq.(16) is obtained by projecting F{x) onto ipj,i{x) 



if, 



F{x)'i()j^i{x)dx. 



(18) 



The WFC is basically the difference between the 
smoothed flux F{x) in cells [xi + lL/2^ , xi-\- {1 + 
l/2)L/2i] and [xi + {l + l/2)L/2^ ,xi + {l + l)L/2i]. 
Therefore, the WFC ej^, can be used as the vari- 
able (5(a;-l-r)—(5(a;) in §2, where a; ~ xi+lL/2^ and 
r ~ L/2^ . All compactly supported wavelet bases 
produce similar results. We will use Daubechies 4 
in the study below. 

3.2. The intermittent exponent in the 
DWT basis 

We can express eq.(3) in the DWT representa- 
tion by replacing the density difference, 6{x + r) — 
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6{x), by the wavelet coefBcient eji as discussed in 
§3.1. Thus, 

= (i?,yn = ^Ei^xr' (19) 

1=0 

where j plays the same role as r in eq.(3). Sj"' 
is the mean of moment |eJ;P" over the position 
index I. With the DWT, eq.(4) becomes 

(20) 

In Paper I, Sj is used to define the DWT power 

spectrum P", i.e. P" = Sj. P" is the power spec- 
trum of the transmitted flux fluctuations AF = 
F — (F). In Paper I, or Jamkhedkar, Bi & Fang 
(2001), and Yang et al. (2001a), the power spec- 
trum Pj" differs from eq.(20) and (21) by a noise 
term. Since the noise is gaussian, this term can be 
ignored on scales for which Sj is larger than the 
variance of the noise. 

Eq.(20) can also be written as S"^ = i E?=o ^ Pfi^ 
where 

pri = & (21) 

where PjJ is the local power, i.e., the power on 
scale j at position I. 

Considering r ~ the intermittent expo- 

nent C [eq. (5)] can be calculated from 

5?" . , 



Generally, C depends on n and j- 
4. Samples 

4.1. Keck data of Lya forests 

The experimental data used in our study is the 
same as Paper I. It consists of 28 Keck HIRES 
QSO spectra (Kirkman & Tytler 1997). The QSO 
emission redshifts cover a redshift range from 2.19 
to 4.11. For each of the 28 QSO's, the data are 
given in terms of pixels with wavelength A,, fltix 
F(Aj) and noise a{\i). The noise includes the 
Poisson fluctuations and the noise due to the back- 
ground and the instrumentation. The continuum 
of each spectrum is given by IRAF CONTINUUM 
fitting. 



For our purposes, the useful wavelength region 
is from the Ly/3 emission to the Lya emission, ex- 
cluding a region of about 0.06 in redshift close to 
the quasar to avoid any proximity effects. In this 
wavelength range, the number of pixels is about 
1.2 X 10* for each spectrum. 

For all bins in this data set, the ratio AA/A is 
constant, AA/A ~ 13.8 x lO'^, or 6v ~ 4.01 km 
s~^, and therefore, the resolution is about 8 km 
s^^. The distance between A'' pixels in units of 
the local velocity scale is given by Av = 2c(l — 
exp [— (l/2)iV(5t;/c]) km s~^, or wavenumber fc = 
27r/At; km-^ s. 

We do a scale by scale decomposition of the 
data. We use only 2^^ = 8192 pixels for each spec- 
trum. Thus, each cell on scale j corresponds to 
iV = 2^"^"^ pixels. The smallest scales are gener- 
ally dominated by noise. We study only scales 
> 16 km s~^, corresponding to > 4 pixels or 
3 < 11, and k < 0.4 km"-*^ s. Since metal lines 
are generally narrow with Doppler parameter b< 
15 km s~^, ignoring scales less than 16 km s~^ 
also suppresses metal line contamination. The al- 
gorithm for treating unwanted data (pixels with 
negative flux or missing data) and detected metal 
lines with space-scale decomposition will be dis- 
cussed in more detail in §4.3. 

In applying our algorithm, we sometimes use 
the 28 QSO transmissions individually, i.e. calcu- 
late the statistics of the transmission over each 
QSO separately, and sometimes all the transmis- 
sions are treated together. In the latter case, 
we divide the data into 12 redshift ranges from 
z = 1.6 + nx 0.20 to 1.6 + {n + l)x 0.20 where 
n = 0, 1, ..11. All the transmission flux in a given 
redshift range forms an ensemble. Note that the 
number of data points in each redshift range is 
different. 

4.2. Simulation samples 

The simulated samples of the Lya forests are 

produced as in Paper I. Besides the LCDM model, 
we also simulate the warm dark matter (WDM) 
model, for which the linear power spectrum on 
scales smaller than the free-streaming length of the 
warm particle, Rf = 0.2{noh^y/^{mw/^eVr'^^^, 
is damped exponentially with respect to the pure 
CDM model. 

The WDM model is believed to be a possible 
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candidate to solve the problem of cuspy halos, or 
the singular mass profiles of massive objects. The 
problem appears in high resolution N-body simu- 
lations of CDM models where the models predict 
central cusps in the dark halos (Jing & Suto 2000). 
However, observations instead show soft halo pro- 
files as inferred from low surface brightness galax- 
ies and the rotation curves of dwarf galaxies (Flo- 
res & Primack 1994, Burkeret 1995). The WDM 
models arc able to soften the density profile of the 
central core, while at the same time having no 
effect on large scales. Hence, WDM models are 
proposed in order to deal with the nonlinear clus- 
tering on small scales. Since the cosmic field on 
those scales is already intermittent, it is important 
to examine the intermittent behavior of the WDM 
models. We consider four WDM models having 
particle masses mw = 300, 600, 800 and 1000 eV. 

4.3. Treatment of unwanted data 

In the Keck transmission flux, there are suspect 
data including bad pixels (gaps without data), and 
negative flux pixels. The latter are generally satu- 
rated absorption regions having lower S/N. Al- 
though the percentage of low S/N data is not 
large, it will introduce large uncertainties in the 
analysis. 

As in Paper I, we use the conditional-counting 
method to treat the unwanted data. Briefly, the 

algorithm is as follows: 

1. Calculate the SFCs for both the transmis- 
sion F{X) and noise cr(A), i.e. 
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= j F{x)(j)ji{x)dx, 4 = j (y{x)4>ji{x)dx. 

(23) 



</ 



(24) 



2. Identify unwanted mode [j, I) using the con- 
dition 

4 

where / is a constant. This condition flags 
all modes with S/N less than /. We can also 
flag modes dominated by metal lines. 

3. Since all the statistical quantities in the 
DWT representation are based on an aver- 
age over the modes (j, /)) we do not count 
all the flagged modes when computing these 
averages. 



Condition (24) is applied at each scale j. If the 
size of a bad data segment is d, condition (24) only 
flags modes [j, I) on scales less than or compara- 
ble to d. Therefore in this algorithm, no rejoining 
and smoothing of the data is needed. We also flag 
two modes around a unwanted mode to reduce any 
boundary effects of the chunks. With this method, 
we can still calculate the structure functions and 
intermittent exponent by eqs.(19), (20) and (22), 
but the average is not over all modes, but over the 
un-flagged modes only. Since the DWT calcula- 
tion assumes the sample is periodized, this may 
cause uncertainty at the boundary. To reduce this 
effect, we drop 5 modes near the boundary. 

5. Second and lower order statistical prop- 
erties 

5.1. The power spectrum 

The power spectra of the transmitted flux of 
the Keck data and simulated LCDM samples have 
been studied in detail in Paper I. The spatial dis- 
tribution of local power Pj| show spiky features. 
This property leads to large uncertainty of Pj*. 

Now we do a similar analysis for the WDM sam- 
ples. Fig. 1 plots the mean and error bars for P" 
for j = 10 (fc = 0.2 km-i s) of the 28 QSO's 
pseudo hydro simulation samples for the WDM 
model with mw = 300, 600, 800 and 1000 eV. For 
each QSO, we calculate the mean for each re- 
alization then calculate the mean over the 20 re- 
alizations. The error bar is given by the range 
of Pj"s by dropping the highest and lowest P^" of 
the 20 realizations. This is equivalent to dropping 
the top and bottom 5% of the data. As expected, 
the mean power PJ* is slightly lower for smaller 
mw- But the error bars of the power spectrum are 
generally much larger than the mvi^-dependence of 

j ■ 

In Fig. 2, we present the mean of the power 
spectrum in 8 rcdshift bins and on scales j = 8 
to 11 (fc = 0.05,0.1,0.2,0.4 km-i s) for the Keck 
data and all the simulated samples. On scale j = 8 
(fc = 0.05 km"-'^ s) and for all rcdshift bins, the 
powers decrease in the order from LCDM, to the 
WDM of mw = 1000, 800, 600, and 300 eV. This 
order is expected, as on scale j — S and rcdshift 
z > 2, the nonlinear and intermittent features are 
weak, and the power spectra trends are about the 
same as in linear regime. That is the power is 
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z z 

Fig. 1.— The power, Pf, on scale j = 10 (fc = 0.2 

km"-*^ s) of the 28 QSO's pseudo hydro simulated 
samples for the WDM models mw = 300, 600, 800 
and 1000 eV. The mean is given by average the 
of the power of the 20 realizations for each QSOs. 
The error bars are the dispersion of 18 realizations, 
which are given by dropping the highest and lowest 
P" of the 20 realizations. 
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Fig. 2. — The power spectrum vs. j for 
Keck data (vortex with three legs), LCDM (ver- 
tex with six legs) and WDM models of mw= 1000 
(hexagon), 800 (pentagon), 600 (square) and 300 
(triangle) cV. in rodshift bins 1.90 ±0.1, 2.3 ±0.1, 
2.5 ± 0.1, 2.7 ± 0.1, 2.9 ± 0.1, 3.1 ± 0.1, 3.3 ± 0.1 
and 3.7 ± 0.1. The wavenumber for scale j is 
fc = 0.4 X 2^-11 km-i s. 

smaller for smaller mw, and the LCDM corre- 
sponds to an infinite mass niw On scales j > 8 
{k > 0.05 km~^ s) the trend changes. This is 
because the nonlinear evolution is significant on 
these scales. The mean power is mainly depen- 
dent on rare and improbable events, and therefore, 
it induces a large uncertainty. 

Fig. 2 also shows that the power spectrum of 
the WDM model with mw = 300 eV is systemat- 
ically lower than the power of Keck data on al- 
most all scales and higher rcdshifts (z > 2.3). 
Therefore, this model can be ruled out by the 
power spectrum. On the other hand, there are 
no systematic differences between the power spec- 
tra of the real data, and the LCDM, or WDM 
{mw = 600, 800, lOOOeV) data. The dispersions 
among these power spectra generally are less than 
a factor of 2. Considering the error bars of the ob- 
served power spectrum (Paper I) and that shown 
in Fig. 1 are larger than by a factor of 2, the 
LCDM, mw = 1000,800 and 600 eV models are 
basically consistent with the Keck data. This 
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is one reason for proposing WDM models with 
mw > 600 eV. 



5.2. Cumulative distribution function (CDF) 
of local powers 

The PDF of the local power PjJ [eq.(21)] for 
a given j are found to be long tailed (Paper I). 
This property can also be shown by the cumu- 
lative distribution function (CDF), which is de- 
fined as the percentage of the modes with local 
power less than a given P^^/P". Fig. 3 presents 
the CDFs of P/J/P/ on scales j = 8,9,10 and 
11 {k = 0.05, 0.1, 0.2, 0.4 km-i s), and in redshift 
range 1.7 to 1.9. 

The CDFs generally consist of two parts: a 
rapidly growing branch at Pji/Pj' < 1, and very 
slowly growing branch at PfJ Pf ^ 1- The latter 
is given by spiky modes and the former from the 
passive modes between the spikes. 

\N = 



ji IS a X 

^ distribution 



For a gaussian field, the PDF of P, 
1) distribution. The CDF of the x 
is also shown in Fig. 3 (dotted line). This CDF 
approaches 1 at P^VP" ^ 10, i.e. ~ 3 tr. Fig. 3 
shows that the CDF of the Keck data approaches 
1 at PjJ/P/ ~ 15 (j = 8), 20 (j = 9), 30 (j = 10), 
and 60 (j = 11) and shows that the Keek's CDF on 
small scales has a much longer tail than a gaussian 
field. 

We also calculate the CDFs of the quasi-hydro 
simulation samples for the LCDM and WDM mod- 
els. The CDFs of simulation samples are generally 
in good agreement with the Keck data. Only the 
CDF of model mw = 300 eV on scale j = 8 shows 
a strong deviation from Keck data at Pfi/Pf > 5. 

On scales j = 10 and 11, the CDFs of the simu- 
lation samples show a little more rapid growth at 
P^"/P" < 1 than Keck data, and longer tail than 
the Keck data at P'J/P" > 10. The CDF of the 



60 



simulation samples approaches 1 at PV^/Pj 
{j = 10), and 100 (j = 11). This seems to indi- 
cate that the long tail behavior of the quasi-hydro 
simulation samples is more prominent than the ob- 
served samples. 

However, care should be taken about any con- 
clusion made with the CDFs. The rapidly growing 
branch, which is given by the quiet modes, is sen- 
sitive to noise. On the other hand, the horizontal 
branch is sensitive to spiky events, and therefore, 
depends on the number of modes of the sample 




20 40 60 80 
pu /pu 



50 100 
pu /pu 



Fig. 3.— The CDF of P^i/P" on scales j = 8, 9, 10 
and 11 for Keck data (solid line), LCDM (cross), 
and WDM models of mw= 1000 (hexagon), 800 
(pentagon), 600 (square) and 300 (triangle) eV in 
redshift bins 1.90 ± 0.1. The dotted lines are for 
gaussian field, i.e. the CDF of x^(-^ = 1) distribu- 
tion. The wavenumber for scale j isk = 0.4x2-'"^^ 
km~^ s. 
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considered. Therefore, from the CDFs of Fig. 3, 
one can only conclude that the LCDM and WDM 
with mw = 600, 800, and 1000 eV are consistent 
with the Keck data. 

5.3. Probability distribution functions of 
the SFCs 




Fig. 4.— The PDF of flux y^W/Lefi on scales j = 
12 for observed (histogram) and simulated LCDM 
samples (circular) of Q0130 on scales j = 12 (fc = 
0.8 km"-'^ s). The redshift of the transmitted flux 
is in the ranges z = 2.4 - 2.6. The error bars are 
given by bootstrap resampling. 

The PDF of the transmitted flux F is often used 
as a statistical measure of the Lya forests. The 
SFCs e^i defined by eq.(17) are proportional to 
the Lya transmitted flux at position /. The ej^ 
multiplied by ^/W/L is the mean transmitted flux 
in the cell Therefore, the PDF of y/2^/Lefi 

is the PDF of F smoothed on scale j . 

As an example, Fig. 4 presents the PDF of 
jLt^^i for the observed QSO, Q0131 on scales 
2 = 12. The error bars are given by bootstrap 
resampling. In Fig. 4, we also plot the PDFs 
of \/2Vlej^ on j = 12 {k = 0.8 km"! s) for a 
simulation sample of the LCDM model. It can 
be seen, even on this small scale, that the PDFs 
of the simulation sample basically are still con- 
sistent with observations. The biggest difference 



between the observation and model prediction is 
at y/ 2^ / Leji ~ 1. However, this difference is sen- 
sitive to noise. Fig. 5 gives the PDFs of yJ^T/Lt^i 
on j = lQ{k = 0.2 km-i s) for the LCDM simula- 
tion, but with gaussian noise of different cr added. 
The peak at ^J2i /Le^i ~ 1 is significantly depen- 
dent on the noise. Moreover, for the Keck sample, 
the noise level is correlated with F. Generally, 
noise is high at F ~ 1, and low at F ~ 0. This 
makes the peak of the PDF at ^J2i /Le^i ~ 1 more 
uncertain. 




Fig. 5.— The PDF of SFCs yW^ej; scales j = 
10 (fc = 0.2 km~^ s) for simulated LCDM samples 
for Q0130. The samples are added gaussian noise 
in each bin with standard deviation zero (solid), 
cr (dot), 2a (dash), and 4c7 (dot-dash), where a is 
the mean standard deviation for the keck samples. 

Statistics with SFCs ej^ [eq.(17)] generally are 
more sensitive to noise than WFCs ej"; [eq.(18)]. 
This is because ej^; is the density difference be- 
tween neighboring positions, and the uncertainty 
of the background noise on large scales is canceled 
in this difference. On the other hand, efj is the 
mean density at it is contaminated by the uncer- 
tainty of the background noise on all large scales. 
Thus, one may conclude that the PDFs of flux F 
is not reliable as a discriminator. 
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6. Structure functions and intermittent 

exponent of Lya forests 

6.1. Structure functions 

With cqs.(19) and (22), we calculate the struc- 
ture functions for the transmitted flux fluctuations 
of the Keck data. A typical result is plotted in 
Fig. 6, which is given by the Keck data in redshift 
range z = 2.50 ± 0.1 for all QSOs. The error bars 
are the maximum and minimum of bootstrap re- 
sampling. It is well known that the higher order 
correlation functions for large scale structure sam- 
ples have larger error than the 2-point correlation 
function. However, even for the 8th order statis- 
tic In 5^/(5*1)^, the error bars are not much larger 
than that of the power spectrum, i.e., the second 
order statistics P". The structure functions are a 
very stable statistical quantity because the struc- 
ture functions are a ratio between 5*1" and (Sj)", 
which reduces the effect caused by high spikes. 

Fig. 6 shows that, for a given n. In S'|"/(S'|)"' 
increases with scale j, and therefore, the intermit- 
tent exponent ^ is non-zero and negative [eq.(22)]. 
The distribution of the transmitted flux fluctua- 
tions are neither gaussian nor self-similar, but es- 
sentially intermittent. At other redshift ranges, 
the structure functions behave similarly. This con- 
clusion was already found with the local power dis- 
tribution (Paper I). However, the local power dis- 
tribution is based on the measures of each individ- 
ual mode, iji and has a large uncertainty. The dis- 
tribution of local power is not effective for model 
discrimination. On the other hand, the structure 
function is calculated by taking an average among 
modes. Statistically, they are more effective to test 
models. 

This point can be seen with Fig. 7, which plots 
the structure function In S'J"/(5'|)" of the trans- 
mitted flux fluctuations of the simulation samples 
for the LCDM and WDM models in redshift range 
z = 2.50 ± 0.1. The error bars are the maximum 
and minimum of bootstrap resampling. As a com- 
parison, the structure functions of the Keck data 
are shown in each panel too. We can see from Fig. 
7 that the structure functions of the LCDM and 
WDM models show intermittent behavior. How- 
ever, the j-dependence of the structure hmctions 
of either the LCDM or the WDM models is clearly 
different from the Keck data. For the simulation 
samples, the slope is steeper than the Keck data. 
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Fig. 6.— log2[S'|"/(S'2)"] vs. j with n=2, 3, 4 for 
Keck samples in the redshift range z = 2.5± 0.1. 
The error bars are given by the maximum and 
minimum of bootstrap resampling. The wavenum- 
ber for scale j is k = 0.4 x 2^~^^ km~^ s. 

This result shows that the intermittent behavior 
cannot be measured by any individual high order 
moment or correlation function, but must be mea- 
sured by the scale- and n-dependencies of the ratio 
between the moments. 

It is also interesting to point out that the struc- 
ture functions on small scales j = 9, 10, and 11 
{k = 0.1, 0.2 and 0.4 km~^ s) are larger for smaller 
mass mw- The structure functions of mw = 300 
eV are always the largest one compared to the 
other models. This is because the WDM model 
with smaller mass mw, or longer free-streaming 
length Rf, lacks power on small scales. The den- 
sity fluctuations on those small scales mainly orig- 
inate from the nonlinear process of transferring 
power from large to small scales (Suto & Sasaki 
1991). On the other hand, the structure func- 
tion In5|"/(S'|)" is the 2n-th moment 5^" nor- 
malized by the power S"^, and hence, essentially 
measures the fraction of density fluctuations which 
undergoes a nonlinear evolution due to transfer of 
power. Therefore, smaller mw leads to a stronger 
intermittency on scales less than Rf. 

Figs. 6 and 7 show that the relations of 
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Fig. 7.— log2[6'2"/(S'2)"] vs. j for the mod- 
els of the LCDM (star) and WDM mw= 1000 
(hexagon), 800 (pentagon), 600 (square) and 300 
(triangle) eV in the redshift range z = 2.5 ± 0.1, 
and n = 2. 3 and 4. For comparison, the result 
for Keck data(vertex with three legs) is also plot- 
ted. The error bars are given by the maximum 
and minimum of bootstrap resampling. For clar- 
ity, the points for simulation samples are shifted 
horizontally to the right from the corresponding 
j. The wavenumber for scale j is k = 0.4 x 2^~^^ 
km~^ s. 

ln5|"/(5?)" vs. j for all models and Keck data 
can be very well fitted by a line. That is, the 
intermittent exponent C, which is the slope of the 
fitting line, is a constant in the scale range from 
i = 5 to 11 [eq.(22)], i.e. from k = 0.006 to 0.4 
km~^ s. 

6.2. n-dependence of the intermittent ex- 
ponent 

We now turn to the n-dependence of the struc- 
ture function and intermittent exponent. Prom 
eq.(22), we have 

1 S'^" 

C{n) = - - ln2 j^r- + const, (25) 
J [^j ) 

or 

C(n)-C(l) = -jln2^, (26) 



For a given j, the n-dependence of ({n) is given 
by Inz^f 7(^2)" vs. n. 

Fig. 8 presents ln2 -Sj"/ (Sf)" vs. n for the Keck 
data in the redshift range z ~ 2.50 ± 0.1. For a 
gaussian field, the n-dependence of ln2 S'|"/(5|)" 
is given by eq.(7), i.e. In2(2n — 1)!!, which is also 
plotted in Fig. 8. The figure shows that the differ- 
ence between the Keck data and a gaussian field 
is greatest at small scales. 

More interesting is to fit the observed n- 
dependence of Ins S'?"/(S'?)" with 

ln2 ^ oc n"(n - 1). (27) 

The motivation is simple, as a = corresponds 
to hierarchical clustering [eq.(12)], and a = 1 to 
a lognormal field [eq.(15)]. Fig. 8 shows that the 
best fit of a for the Keck data is about 0.3 on scales 
j = 8 and 9 (fc = 0.05 and 0.1 km^^ s), but a ~ 0.5 
or higher on scales j = 10 and 11 (fc = 0.2 and 0.4 
km~^ s). This indicates that the transmitted flux 
is closer to a lognormal field on small scales. In 
other words, on scales for which the intermittency 
has been fully developed, the transmitted flux field 
can be modeled by a lognormal field. This may be 
the reason that lognormal models of Lya forests 
match very well with observations not only at sec- 
ond and lower order statistics of Lya forests Bi & 
Davidsen, 1997), but also with higher order behav- 
ior, like the scale-scale correlations (Feng & Fang 
2000). 

Figs. 9 and 10 are, respectively, the ln2 Sj"/ (Sj) 
vs. n for the LCDM model and the WDM models 
with mw — 300 eV. They are significantly dif- 
ferent from a gaussian field. This is consistent 
with the CDFs results (§4.2). The best fit for a 
is always about 0.2, regardless of scale. For the 
WDM models with other mass niw, the results are 
similar to Figs. 9 and 10. That is, the intermit- 
tency of the simulation samples is approximately 
that of hierarchical clustering, i.e., a mono- fractal 
distribution. 

Thus, one may conclude that the intermittency 
of the simulation samples is different from those 
of the Keck data not only quantitatively, but also 
qualitative. The former is scale-independent, and 
close to a hierarchical clustering, while the later is 
close to a lognormal field on small scales. 



12 



Fig. 8.— log2[S'?"/(S'|)"] vs. n for j = 8,9,10 
and 11 for the Keck data in the redshift range 
z = 2.50 ±0.1. The error bars are given by boot- 
strap resampling. The fitting curves are n"(n — 
1). The dotted curves are for gaussian field, i.e. 
log2['S|"/(S'|)"] = fog2(2n - 1)!!. The wavenum- 
ber for scale j is fc = 0.4 x 2^~^^ km~^ s. 

6.3. Redshift-dependence of intermittent 
exponent 

In the last two subsections, only redshift z = 2.4 
to 2.6 is considered. For other redshift bins, the 
result is about the same as z = 2.4 to 2.6. The 
against redshift z for n = 2, 3 and 4 is shown in 
Fig. 11. The error bars are from the least squares 
fitting [eq.(25)]. 

In all redshift ranges, the value of |C| for the 
Keck data is found to be substantially and system- 
atically lower than that of the LCDM and WDM 
models. This can directly be seen from with Fig. 
7. This is also consistent with Figs. 8-10. It is 
interesting that either for the Keck data or the 
models, the intermittent exponents are almost in- 
dependent of redshifts in the range z = 2 to 4. 
This is very different from both the observation 
and theory of massive halos. Collapsed halos with 
mass on the order of galaxies and clusters undergo 
a significant evolution in the redshift range from 
4 to 2. 

The difference between the statistics of the 



Fig. 9. log2[5'|"/(S'|)"] vs. n for j = 8,9,10 
and 11 for the simulation samples of the LCDM 
in the redshift range z = 2.50 ± 0.1. The error 
bars are given by the maximum and minimum 
of bootstrap resampling. The fitting curves are 
n"(n — 1). The dotted curves are for gaussian 
field, i.e. log2[5?"/(6'?)"] = log2(2n - 1)!!. The 
wavenumber for scale j is k = 0.4 x 2^~^^ km~^ s. 

Lya forests and massive halos is probably due to 
the fact that the QSO's transmitted flux is given 
mainly by the absorption of baryonic matter out- 
side of the collapsed massive halos, i.e., the weakly 
clustered area. Collapsed halos correspond to the 
saturated absorption in the transmitted flux for 
which the S/N generally is low. Therefore, the 
Lya forest does not contain information on the 
details of the massive halos. 

Fig. 11 also shows that the intermittent ex- 
ponents of the LCDM and WDM models with 
mw = 300 eV are about the same. This is also 
different from massive halos, for which the LCDM 
and WDM models predict different mass density 
profiles and number of substructures. 

We have simulated the intermittent formation 
at high redshift. The results show that in the area 
outside of the collapsed halos, the intermittency is 
already well developed at redshift z ~ 5, and does 
not increase much for z < 5. Therefore, intermit- 
tency is probably the earliest developed nonlinear 
feature formed during the evolution from the lin- 
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Fig. 10.— log2[S'|"/(5|)"] vs. n for j = 8,9,10 
and 11 for the simulation samples of the WDM 
niw = 300 eV. in the redshift range z = 2.50 ±0.1. 
The error bars are given by bootstrap resampling. 
The fitting curves arc n"(n— 1). The dotted curves 
are for gaussian field, i.e. \og2[Sj"-/{Sj)"'] = 
log2(2n — 1)!!. The wavenumber for scale j is 
fc = 0.4 X 2^-11 km-i s. 

ear to the nonlinear regimes. 

6.4. The effect of noise 

In the above calculations, we generally take the 
parameter / = 1, i.e., drop modes with S/N < 1. 
The effect of noise on the statistical result can be 
estimated by the /-dependence, as large / is equal 
to adding large noise to the simulation samples, 
but still taking / = 1. In Paper I, we showed that 
the power spectrum P" is almost /-independent 
when / = 1 to 5. 

Fig. 12 shows In5?"/(S'2)" vs. j for the LCDM 
model when the noise and / are treated as follows: 
1.) sample without adding noise, and / = 0; 2.) 
sample without adding noise and / = 3; 3.) sam- 
ple with adding noise and / = 3. Fig. 12 shows 
that the effect of noise and / should be consid- 
ered only on the smaller scale j = 11, and can be 
ignored on all other scales. 

Fig. 13 is similar to Fig. 10, but with added 
noise and taking / = 3. Similar to Fig. 12, the 
effect of noise and / on a should be considered 



Fig. 11. — C„ vs. z, with n=2, 3, 4 for the Keck 
data and the LCDM (a), and WDM with mw = 
300 eV. The error bars are from the least squares 
fitting. 

only on the smallest scale j = 11 and gives a of 
about 0.45. Even in this case, the simulation sam- 
ples are still significantly different from the Keck 
data. Therefore, one can conclude that all results 
of §6.1 to 6.3 are not very sensitive to noise. 

7. Discussions and conclusions 

7.1. Model-discriminator of nonlinear regime 

In the Paper 1. we showed that in the nonlinear 
regime, the cosmic mass field is intermittent and 
the power spectrum will on longer be a critical 
discriminator among models of structure forma- 
tion. In this paper, we show that for an intermit- 
tent field, the structure functions and intermittent 
exponent are the critical discriminators for mod- 
els of structure formation. This discriminator can 
be employed not only to distinguish gaussian and 
non-gaussian fields, but to detect the type of the 
non-gaussianity. With the n- j- and z-dependence 
of the structure functions and the intermittent ex- 
ponent, we are able to distinguish between various 
nonlinearly evolved fields in detail. That is, the in- 
termittent exponent provides both qualitative and 
quantitative measures of the cosmic mass and ve- 
locity fields from the linear to nonlinear regimes 
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WDM 300 eV 



Fig. 12 — log2 S'|"/(S'^)" vs. j for simulation 

sample of the LCDM. The noise and / are treated 
by the following ways: 1.) sample without adding 
noise, and / = (triangle), 2.) sample without 
adding noise and / = 3 (square), 3.) sample with 
adding noise and / = 3 (pentagon). For clarity, 
the points for 2.) and 3.) are shifted horizontally 
to right from the corresponding j. The wavenum- 
ber for scale j is k = 0.4 x 2^~^^ km~^ s. 

and from large to small scales. 

We show that the distribution of the transmit- 
ted flux fluctuations of the QSO's Lya forests is 
intermittent and closer to a lognormal field on 
small scales. Obviously, this conclusion is impor- 
tant in order to understand the dynamics of the 
clustering evolution of baryonic matter. It will 
be interesting to study when this feature formed, 
and whether the intermittency on scales less than 
j = 11 (fc = 0.4 km~^ s) is closer to a lognormal 
field. These problems can be studied with sam- 
ples at higher redshift and higher S/N than the 
one used in this work. 

Using intermittent features, even "weakly" 
clustered samples, such as QSO's Lya absorption 
spectrum, can play an important role for test- 
ing structure formation dynamics in the nonlinear 
regime. It is also important to detect the intermit- 
tency of galaxy distributions, as the transmitted 
flux of Lya cannot provide information of highly 
collapsed regions in the cosmic mass field. Note 
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Fig. 13. — The same as Fig. 10, but the sample is 
added with noise and / is taken to be 3. 

that from the definition of the structure function 

eq.(l), the intermittent exponent is bias-free if the 
galaxy bias is linear, i.e. p galaxy = bpdark matter- 

7.2. Problems with the LCDM and WDM 

samples 

Although the pseudo-hydro simulation samples 
for the LCDM and WDM models are good in 
agreement with the Keck data in terms of the 
power spectrum, PDF and CDF of the transmis- 
sion flux fluctuations, they are significantly and 
systematically inconsistent with the intermittent 
features of the Keck data. Specifically, 1.) the 
structure functions of the simulation samples are 
larger than that of Keck data on scales less than 
k = 0.1 km^^ s; 2.) the intermittent exponents 
of the simulation samples are more negative than 
that of Keck data on all redshifts considered; 3.) 
the n-dependence of the intermittent exponent of 
simulation samples are close to the intermittency 
of hierarchical clustering on all scales, while the 
Keck data are close to a lognormal field on small 
scales. That is, the model-predicted intermittency 
is quantitatively and qualitatively different from 
the observed results. 

This is probably the first result to reveal the de- 
viation of the popular dark model LCDM model 
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from the Lya forest observations. However, we 
sliould be very careful in reacliing conclusions rul- 
ing out the relevant dark matter models. For a 
linear gaussian field, all statistics can be deter- 
mined by power spectrum with given dark matter 
parameters. However, intermittency arises from 
the nonlinear evolution and depends not only on 
the cosmological parameters, but also on 1.) dy- 
namical assumptions of the relation between the 
IGM and underlying dark matter field and 2.) pa- 
rameters used for simulation. 

For instance, the lognormal model of Lya 
forests (Bi & Davidsen 1997, Feng & Fang 2000) 
assumes phenomenologically that the nonlinear 
field of baryonic matter is given by a lognormal 
transformation of the underlying mass field in lin- 
ear regime. The sample given by this dynamical 
assumption can certainly fit with a lognormal PDF 
of the Lya transmitted flux. 

In the pseudo hydro simulation, the density of 
baryonic matter in each pixel is assigned accord- 
ing to the dark matter density at that pixel. This 
is equal to phenomenologically assuming that the 
statistical property of the baryonic matter is deter- 
mined by the underlying mass field. This assump- 
tion is reasonable in terms of second and lower 
order statistics. However, it may not be correct 
in the context of intermittency. That is, although 
baryonic matter can be considered as a passive 
component in the system consisting of dark mat- 
ter and baryonic matter, the statistical properties 
of the passive component can decouple from the 
underlying mass field during the nonlinear evolu- 
tion (Shraiman & Siggia 2000). For instance, a dif- 
fused passive substance can exhibit intermittency, 
even when the underlying mass field is gaussian 
(Kraichnan, 1994). This is due to the nonlinear 
evolution of this two component system. 

Moreover, the parameters of simulation may 
also cause uncertainty in measuring the intermit- 
tency. In the paper I, we studied the effect of the 
sample size on the power spectrum. When the 
characteristic spacing between high spikes (long 
tail events) exceed the spatial size of samples, the 
spatial averages cease to coincide with ensemble 
averages. As a consequence, the mean power of the 
transmission fluctuations is concentrated in rare 
but large spikes while the other modes have low 
power, or are even inactive. We showed that on 
scales less than k ~ 0.10 km~^ s, 50% or more 



of the power is contributed by the top 5% modes 
for unnormalized power spectrum, and 1% modes 
for the locally-normalized power spectrum. There- 
fore, before making conclusions about the dark 
matter parameters, we should study whether the 
deviation is caused by the dynamical assumptions 
on the relation between the IGM and dark mat- 
ter. We should also study the effects of simulation 
parameters (size, resolution etc) on intermittency. 

Nevertheless, the current result is strong enough 
to conclude that all theory and models on the dy- 
namics of nonlinear evolution of the cosmic mass 
and velocity fields must be examined with their 
predictions of intermittency. For models which 
are almost degenerate in the linear regime, the 
tests with intermittency in nonlinear regime are 
crucial. 

Finally, we should mention that the dynamical 
evolution of cosmic clustering was found to be able 
to sketch by the stochastic-force-driven Burgers' 
equation (Berera & Fang 1994; Jones 1999), or the 
so-called KPZ equation (Kardar, Parisi & Zhang 
1986; Barabasi & Stanley 1995). These equations 
are typical of dynamical models which lead to in- 
termittency (e.g. Polyakov, 1995; E, et al 1997; 
Balkovsky et al. 1997). These examples clearly 
illustrate that intermittency is a basic property of 
the nonlinearly evolved cosmic mass field. Hence, 
we strongly believe that intermittency is a win- 
dow to the dynamical evolution of cosmic nonlin- 
ear clustering. Knowledge of intermittency may 
ultimately lead to a better understanding of the 
still intractable problem of galaxy formation. 
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